OcrV1, Main, Exploration, bibRecord, 000187

Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework

Identifieur interne : 000187 ( Main/Exploration ); précédent : 000186; suivant : 000188

Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework

Auteurs : RONG HUANG [Japon] ; Palaiahnakote Shivakumara [Japon] ; YAOKAI FENG [Japon] ; Seiichi Uchida [Japon]

Source :

IEICE transactions on information and systems [ 0916-8532 ] ; 2013.

RBID : Pascal:13-0328463

Descripteurs français

Pascal (Inist)
- Reconnaissance caractère, Opérateur, Reconnaissance optique caractère, Image multiple, Méthode heuristique, Segmentation, Rapport aspect, Méthode raffinement, Marché concurrentiel, Localisation, Reconnaissance parole, Evaluation performance, Etat actuel, Vote, Reconnaissance forme.
Wicri :
- topic : Vote.

English descriptors

KwdEn :
- Aspect ratio, Character recognition, Heuristic method, Localization, Multiple image, Open market, Operator, Optical character recognition, Pattern recognition, Performance evaluation, Refinement method, Segmentation, Speech recognition, State of the art, Voting.

Abstract

To handle the variety of scene characters, we propose a cooperative multiple-hypothesis framework which consists of an image operator set module, an Optical Character Recognition (OCR) module and an integration module. Multiple image operators activated by multiple parameters probe suspected character regions. The OCR module is then applied to each suspected region and returns multiple candidates with weight values for future integration. Without the aid of the heuristic rules which impose constraints on segmentation area, aspect ratio, color consistency, text line orientations, etc., the integration module automatically prunes the redundant detection/recognition and pads the missing detection/recognition. The proposed framework bridges the gap between scene character detection and recognition, in the sense that a practical OCR engine is effectively leveraged for result refinement. In addition, the proposed method achieves the detection and recognition at the character level, which enables dealing with special scenarios such as single character, text along arbitrary orientations or text along curves. We perform experiments on the benchmark ICDAR 2011 Robust Reading Competition dataset which includes a text localization task and a word recognition task. The quantitative results demonstrate that multiple hypotheses outperform a single hypothesis, and be comparable with state-of-the-art methods in terms of recall, precision, F-measure, character recognition rate, total edit distance and word recognition rate. Moreover, two additional experiments are conducted to confirm the simplicity of parameter setting in this proposal.

Affiliations:

Links toward previous steps (curation, corpus...)

to stream PascalFrancis, to step Corpus: 000042
to stream PascalFrancis, to step Curation: 000726
to stream PascalFrancis, to step Checkpoint: 000030
to stream Main, to step Merge: 000190
to stream Main, to step Curation: 000187

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework</title>
<author><name sortKey="Rong Huang" sort="Rong Huang" uniqKey="Rong Huang" last="Rong Huang">RONG HUANG</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author><name sortKey="Shivakumara, Palaiahnakote" sort="Shivakumara, Palaiahnakote" uniqKey="Shivakumara P" first="Palaiahnakote" last="Shivakumara">Palaiahnakote Shivakumara</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author><name sortKey="Yaokai Feng" sort="Yaokai Feng" uniqKey="Yaokai Feng" last="Yaokai Feng">YAOKAI FENG</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author><name sortKey="Uchida, Seiichi" sort="Uchida, Seiichi" uniqKey="Uchida S" first="Seiichi" last="Uchida">Seiichi Uchida</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">13-0328463</idno>
<date when="2013">2013</date>
<idno type="stanalyst">PASCAL 13-0328463 INIST</idno>
<idno type="RBID">Pascal:13-0328463</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000042</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000726</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000030</idno>
<idno type="wicri:doubleKey">0916-8532:2013:Rong Huang:scene:character:detection</idno>
<idno type="wicri:Area/Main/Merge">000190</idno>
<idno type="wicri:Area/Main/Curation">000187</idno>
<idno type="wicri:Area/Main/Exploration">000187</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework</title>
<author><name sortKey="Rong Huang" sort="Rong Huang" uniqKey="Rong Huang" last="Rong Huang">RONG HUANG</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author><name sortKey="Shivakumara, Palaiahnakote" sort="Shivakumara, Palaiahnakote" uniqKey="Shivakumara P" first="Palaiahnakote" last="Shivakumara">Palaiahnakote Shivakumara</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author><name sortKey="Yaokai Feng" sort="Yaokai Feng" uniqKey="Yaokai Feng" last="Yaokai Feng">YAOKAI FENG</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author><name sortKey="Uchida, Seiichi" sort="Uchida, Seiichi" uniqKey="Uchida S" first="Seiichi" last="Uchida">Seiichi Uchida</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">IEICE transactions on information and systems</title>
<title level="j" type="abbreviated">IEICE trans. inf. syst.</title>
<idno type="ISSN">0916-8532</idno>
<imprint><date when="2013">2013</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">IEICE transactions on information and systems</title>
<title level="j" type="abbreviated">IEICE trans. inf. syst.</title>
<idno type="ISSN">0916-8532</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Aspect ratio</term>
<term>Character recognition</term>
<term>Heuristic method</term>
<term>Localization</term>
<term>Multiple image</term>
<term>Open market</term>
<term>Operator</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Performance evaluation</term>
<term>Refinement method</term>
<term>Segmentation</term>
<term>Speech recognition</term>
<term>State of the art</term>
<term>Voting</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Reconnaissance caractère</term>
<term>Opérateur</term>
<term>Reconnaissance optique caractère</term>
<term>Image multiple</term>
<term>Méthode heuristique</term>
<term>Segmentation</term>
<term>Rapport aspect</term>
<term>Méthode raffinement</term>
<term>Marché concurrentiel</term>
<term>Localisation</term>
<term>Reconnaissance parole</term>
<term>Evaluation performance</term>
<term>Etat actuel</term>
<term>Vote</term>
<term>Reconnaissance forme</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr"><term>Vote</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">To handle the variety of scene characters, we propose a cooperative multiple-hypothesis framework which consists of an image operator set module, an Optical Character Recognition (OCR) module and an integration module. Multiple image operators activated by multiple parameters probe suspected character regions. The OCR module is then applied to each suspected region and returns multiple candidates with weight values for future integration. Without the aid of the heuristic rules which impose constraints on segmentation area, aspect ratio, color consistency, text line orientations, etc., the integration module automatically prunes the redundant detection/recognition and pads the missing detection/recognition. The proposed framework bridges the gap between scene character detection and recognition, in the sense that a practical OCR engine is effectively leveraged for result refinement. In addition, the proposed method achieves the detection and recognition at the character level, which enables dealing with special scenarios such as single character, text along arbitrary orientations or text along curves. We perform experiments on the benchmark ICDAR 2011 Robust Reading Competition dataset which includes a text localization task and a word recognition task. The quantitative results demonstrate that multiple hypotheses outperform a single hypothesis, and be comparable with state-of-the-art methods in terms of recall, precision, F-measure, character recognition rate, total edit distance and word recognition rate. Moreover, two additional experiments are conducted to confirm the simplicity of parameter setting in this proposal.</div>
</front>
</TEI>
<affiliations><list><country><li>Japon</li>
</country>
<region><li>Kyūshū</li>
<li>Préfecture de Fukuoka</li>
</region>
<settlement><li>Fukuoka</li>
</settlement>
<orgName><li>Université de Kyūshū</li>
</orgName>
</list>
<tree><country name="Japon"><region name="Kyūshū"><name sortKey="Rong Huang" sort="Rong Huang" uniqKey="Rong Huang" last="Rong Huang">RONG HUANG</name>
</region>
<name sortKey="Shivakumara, Palaiahnakote" sort="Shivakumara, Palaiahnakote" uniqKey="Shivakumara P" first="Palaiahnakote" last="Shivakumara">Palaiahnakote Shivakumara</name>
<name sortKey="Uchida, Seiichi" sort="Uchida, Seiichi" uniqKey="Uchida S" first="Seiichi" last="Uchida">Seiichi Uchida</name>
<name sortKey="Yaokai Feng" sort="Yaokai Feng" uniqKey="Yaokai Feng" last="Yaokai Feng">YAOKAI FENG</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000187 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000187 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:13-0328463
   |texte=   Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework
}}

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024

	Serveur d'exploration sur l'OCR
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur l'OCR

Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework

Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework

Source :

Descripteurs français

English descriptors

Abstract

Links toward previous steps (curation, corpus...)

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri